Scalability of Piecewise Synonym Identification in Integration of SNOMED into the UMLS

نویسندگان

  • Kuo-Chuan Huang
  • James Geller
  • Michael Halper
  • Gai Elhanan
  • Yehoshua Perl
چکیده

Synonym identification during source terminology integration into the Unified Medical Language System (UMLS) is a labor-intensive task needed for every new release of the source. The piecewise synonym (PWS) methodology was previously used for the integration of a small source. The goal of this paper is to determine whether the piecewise synonym methodology with two control parameters scales to a much larger terminology (a subset of SNOMED CT), the control parameters are necessary to make the methodology viable, and the control parameters lead to any loss of matching results. Additional methods for limiting the size of the dictionary used in the PWS generation methodology are used. The authors’ methodology discovered 41% of concepts not found by string matching. The necessity and effectiveness of the control parameters were confirmed. Furthermore, when comparing the results of experiments with and without control parameters, no matches were lost. Yehoshua Perl New Jersey Institute of Technology, USA DOI: 10.4018/978-1-4666-2653-9.ch011

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Piecewise Synonyms for Enhanced UMLS Source Terminology Integration

The UMLS contains more than 100 source vocabularies and is growing via the integration of others. When integrating a new source, the source terms already in the UMLS must first be found. The easiest approach to this is simple string matching. However, string matching usually does not find all concepts that should be found. A new methodology, based on the notion of piecewise synonyms, for enhanc...

متن کامل

Determining correspondences between high-frequency MedDRA concepts and SNOMED: a case study

BACKGROUND The Systematic Nomenclature of Medicine Clinical Terms (SNOMED CT) is being advocated as the foundation for encoding clinical documentation. While the electronic medical record is likely to play a critical role in pharmacovigilance - the detection of adverse events due to medications - classification and reporting of Adverse Events is currently based on the Medical Dictionary of Regu...

متن کامل

Using WordNet synonym substitution to enhance UMLS source integration

OBJECTIVE Synonym-substitution algorithms have been developed for the purpose of matching source vocabulary terms with existing Unified Medical Language System (UMLS) terms during the integration process. A drawback is the possible explosion in the number of newly generated (potential) synonyms, which can tax computational and expert review resources. Experiments are run using a synonym-substit...

متن کامل

Characterizing the semantic composition of the UMLS Metathesaurus over time

Motivation. The UMLS Metathesaurus has grown dramatically over the past fifteen years. From 2002 to 2015, the number of concepts has increased from about 777,000 to 3.1 million, a 4-fold increase. It is difficult to infer the semantic composition of the UMLS from the list of its sources. While some source vocabularies contribute concepts from a single semantic category (e.g., anatomical entitie...

متن کامل

Assisting the Translation of SNOMED CT into French

The objective of this study is to evaluate to approaches assisting the translation of SNOMED CT into French. Two types of approaches were combined: a concept-based one, which relies on conceptual information of the UMLS Metathesaurus and a lexical-based one, which relieson NLP techniques. In addition to the French terminologies (whether included in UMLS or not). Using the concept-based approach...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJCMAM

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2011